MIT  LIBRARIES  DUPL 


3  9080  02246  1179 


B  '  Li'  .1  It.J.         \  1 


% 


,'fB-:';!  " 


ii;  ;;l-.:;iyil!;i;!«iaa 


M|j^jj^M^Jiliiii!!|jli||i 


il!H{l'l!?fiiH!!fi!!^!M 


w 

^j\m 

mm 

\m 

f      il! 

Hi 

|» 

mw 

t  ?! 

'Ill 

tm'^ 

HH) 

imi-ili 

irf'T  ' 

mw 


,.  Mil' 

liip 


iw 


i 


Wir:IWi!Hliliniili\i;:imiaiii:ii!il 


1  r^ 


i  itri 


11 


I  iff  I'l 


I 


Ml 


4 


mm\>n> 


Wdh 


,?rg 


illBii 


,j     Iji'il'iifJUi's''  Wp'i     |i  ii!l(i  ii 

llPiMliiii.    ' 

111       !   ''  'f  1'  I  f^ri ''     1 1 


^Mjli.! ,  I, 


Digitized  by  the  Internet  Archive 

in  2011  with  funding  from 

Boston  Library  Consortium  Member  Libraries 


http://www.archive.org/details/evolutionofperceOOacem 


B31 
^415 

36 


CEWEY 


Massachusetts  Institute  of  Technology 

Department  of  Economics 

Working  Paper  Series 


EVOLUTION  OF  PERCEPTIONS  AND  PLAY 


Daron  Acemoglu 
Muhamet  Yiidiz 


Working  Paper  01 -36 
June  2001 


RoomE52-251 

50  Memorial  Drive 

Cambridge,  MA  021 42 


This  paper  can  be  downloaded  without  charge  from  the 

Social  Science  Research  Network  Paper  Collection  at 

http://papers.ssm.com/abstract=^290779 


Massachusetts  Institute  of  Technology 

Department  of  Economics 

Working  Paper  Series 


EVOLUTION  OF  PERCEPTIONS  AND  PLAY 


Daron  Acemoglu 
Muhamet  Yildiz 


Working  Paper  01  -36 
June  2001 


Room  E52-251 

50  Memorial  Drive 

Cambridge,  MA  02142 


This  paper  can  be  downloaded  without  charge  from  the 

Social  Science  Research  Network  Paper  Collection  at 

httpi/Zpapers-Ssm-coni/abstract^xxxxx 


"MASSAciJypfnwsTm^ 

OFTECHMOLOGY 


Evolution  of  Perceptions  and  Play^ 

Daron  Acemoglu  Muhamet  Yildiz 

MIT  MIT 

June  2001 


Abstract 

^.  An  agent  with  a  misperception  may  have  an  evokitionary  advantage  when  his 
misperception  and  its  behavioral  impUcations  are  recognized  by  others.  In  such 
situations  evolutionary  forces  can  lead  to  misperceptions,  yielding  "irrational  be- 
havior," such  as  the  play  of  strictly  dominated  strategies.  We  point  out  that 
this  reasoning  relies  on  the  assumption  of  subjective  rationality — agents  are  as- 
sumed to  choose  the  behavior  that  maximizes  their  perceived  payoffs.  However, 
subjective  rationality  does  not  have  solid  evolutionary  foundations:  in  the  pres- 
ence of  misperceptions,  agents  who  do  not  maximize  their  perceived  payoffs  may 
have  greater  fitness  than  those  who  do.  We  show  that  relaxing  the  subjective 
rationality  requirement,  somewhat  paradoxically,  leads  to  effectively  rational  be- 
havior: although  agents  may  have  systematic  misperceptions,  they  will  develop 
other  biases  to  undo  these  misperceptions,  and  will  act  as  if  they  are  rational.  As 
a  result,  systematic  biases  in  experimental  settings  may  not  necessarily  translate 
into  irrational  behavior.  We  also  demonstrate  that  the  same  evolutionary  forces, 
in  the  long  run,  lead  agents  to  play  as  if  they  have  a  common  prior,  even  though 
each  agent  will  have  different  and  possibly  incorrect  perceptions  of  payoffs  and  the 
rules  of  the  game. 

Keywords:  Common  prior,  evolution,  neutral  stability,  misperceptions,  per- 
ceptions, rationality. 

JEL  Classification:  B40,  C72,  D84. 
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1     Introduction 

Much  of  economics  is  built  on  the  notion  that  agents  will  take  actions  that  maximize 
their  payoff  or  fitness.  Alchian  (1950),  Friedman  (1953),  and  Becker  (1962)  have  all 
emphasized  how  "evolutionary"  competition,  either  through  the  survival  of  the  fittest  or 
social  evolution,  will  tend  to  eliminate  agents  who  deviate  substantially  from  maximiza- 
tion. The  evolutionary  game  theory  hterature  formalizes  these  insights.  Under"  rephcator 
dynamics,  evolutionary  forces  will  typically  eliminate  all  dominated  strategies,  and  any 
rest  point  of  an  evolutionary  process  will  correspond  to  a  Nash  equilibrium  of  an  under- 
lying game  (see,  e.g.,  the  recent  textbooks  on  evolutionary  game  theory,  WeibuU,  1997, 
or  Samuelson,  1997).^ 

The  psychology  literature  and  recent  research  in  behavioral  economics,  instead,  em- 
phasize a  variety  of  biases  and  misperceptions  (e.g.,  over  confidence,  wishful  thinking, 
law  of  small  numbers).  Although  some  of  these  "nonrational"  types  of  behavior  may 
result  from  cognitive  hmitations,  many  scholars  argue  that  nonrational  behavior  can 
arise  because  it  confers  no  payoff  disadvantage  and  may  even  have  payoff  benefits.  First, 
there  is  an  informal  folk  theorem  that,  given  the  complexity  of  the  econoinic  situations 
that  agents  are  involved  in  and  the  speed  with  which  evolution  works,  there  will  not  be 
strong  enough  evolutionary  forces  to  wipe  out  nonrational  behavior  (See  MuUainathan 
and  Thaler,  2000,  for  a  discussion  and  justification  for  this  view).  Second,  many  econo- 
mists have  argued  that  certain  types  of  behavior  that  appear  "irrational"  may  be  useful 
commitment  devices.  Robert  Frank  (1988),  for  example,  points  out  how  the  irrational 
tendency  to  seek  revenge  may  be  useful  by  acting  as  a  commitment  to  punish  those  who 
break  their  promises.  Robson  (1990),  Banerjee  and  Weibull  (1993),  Dekel,  Ely  and  Yi- 
lankaya  (1998),  and  Kockesen  et  al  (1999)  similarly  show  how,  when  agents'  preferences 
(types)  are  observable,  evolution  may  select  preferences  or  types  that  act  "irrationally" 
to  gain  commitment  advantage. 

The  argument  that  irrational  behavior  may  be  a  useful  commitment  device  requires 
agents'  preferences  to  be  (to  some  degree)  observable.  In  fact,  a  number  of  contribu- 
tions, including  Dekel,  Ely  and  Yilankaya  (1998),  Ely  and  Yilankaya  (1999)  and  Ok 
and  Vega-Rodondo  (2000)  show  that  when  preferences  are  not  observed  at  all,  evolution 

'  Other  (non-monotonic)  evolutionary  dynamics  may  lead  to  strategies  that  are  not  Nash  equilibria 
(and  are  even  dominated)  in  the  underlying  game.  See  Bendor  et.  al.  (2001)  and  Stenneck  (2000). 


takes  us  back  to  Nash  equilibria.  All  of  these  papers,  irrespective  of  whether  they  allow 
preferences  to  be  observed  or  unobserved,  rely  on  a  strong  notion  of  subjective  ratio- 
nality: agents  are  assumed  to  always  maximize  their  perceived  utility  or  payoff.  Other 
contributions  in  game  theory  also  start  from  different  perceptions  and  use  a  notion  of 
subjective  rationality  to  derive  tight  predictions  (e.g.,  Harrison  and  Kreps,  1978,  Morris, 
1996,  Van  den  Steen,  2001,  Yildiz,  2000). 

In  this  paper,  we  argue  that  subjective  rationality  does  not  have  strong  evolutionary 
foundations  (see  also  Blume  and  Easley,  1992).  Subjectively  irrational  agents — who  sys- 
tematically choose  strategies  that  do  not  maximize  their  perceived  payoffs — may  obtain 
higher  payoffs  than  subjectively  rational  agents.  Somewhat  paradoxically,  once  we  allow 
mutations  (deviations)  that  relax  subjective  rationality,  there  are  strong  forces  pushing 
agents  towards  effectively  rational  behavior  (with  limited  information)."  Our  main  re- 
sults can  be  summarized  as:  allowing  subjective  irrationality  leads  to  effectively  rational 
behavior  (with  limited  information)  in  any  neutrally  stable  outcome  of  evolution.'^ 

Nevertheless,  we  find  that  there  is  no  immediate  link  between  rational  behavior  and 
accurate  perceptions.  Evolution  will  select  agents  who  will  act  "effectively  rationally", 
but  these  agents  will  have  very  different  perceptions,  and  sometimes  will  make  systematic 
mistakes  (yet  their  mistakes  will  cancel  each  other).  In  this  sense,  our  analysis  provides  a 
microfoundation  for  the  connnon  premise  that  agents  will  typically  behave  as  if  they  are 
rational  (e.g.,  Friedman,  1953).  This  reasoning  suggests  that  systematic  misperceptions 
in  experimental  settings  or  in  situations  with  little  relevance  to  long-run  fitness  do  not 
necessarily  translate  into  widespread  "irrational"  behavior. 

We  also  show  that  in  any  neutrally  stable  outcome  of  evolution,  agents  may  have 
different  perceptions,  but  the  play  will  correspond  to  a  Nash  equilibrium  outcome  of 
a  game  where  players  have  a  common  prior.''    This  result  is  of  interest  since  there  is 


^We  call  an  agent  as  effectively  rational  with  limited  information  iff  he  maximizes  his  expected  fitness 
with  respect  to  his  information.  Such  an  agent  may  lack  some  information,  but  will  never  have  any 
bias  nor  play  dominated  strategies  given  his  information  set.  (A  stronger  notion  of  rationality  will 
correspond  to  eiiectively  rational  with  full  information.)  We  refer  to  this  type  of  behavior  as  "effectively 
rational"  since  agents,  despite  their  misperceptions,  act  as  if  they  are  rational.  Alternatively,  this 
type  of  behavior  could  be  referred  to  as  "objectively  rational"  to  contrast  it  with  subjectively  rational 
behavior.  To  minimize  terminology,  we  stick  to  the  term  "effectively  rational" . 

■'Neutral  stability  is  a  weaker  form  of  evolutionary  stability,  where  stable  strategies  do  not  need  to 
perform  strictly  better  than  all  mutations  in  post-entry  population. 

""But  they  may  not  be  able  to  distinguish  some  underlying  games  from  each  other.  This  common 
prior  will  simply  correspond  to  the  frequencies  with  which  these  underlying  games  are  played. 


disagreement  about  the  plausibility  of  the  common  prior  assumption  in  game  theory  (see 
for  example,  Gul,  1998,  and  Aumann,  1998).  Since  the  common-prior  assumption  is  a 
crucial  epistemological  foundation  for  equilibrium  behavior  in  most  games,  the  criticisms 
regarding  the  common  prior  assumption  have  important  implications  for  all  equilibrium 
analyses  (see  Aumann,  1987,  and  Aumann  and  Brandenburger,  1995,  for  the  importance 
of  the  common  prior  assumption,  and  Dekel  and  Gul,  1997,  for  the  related  criticism  of 
equilibrium  analysis).  Our  results  suggest  that  equilibrium  analysis  with  the  common- 
prior  assumption  may  have  good  evolutionary  foundations — even  when  players'  expressed 
beliefs  are  incompatible  with  each  other. 

It  is  important  to  note  that  the  neutrally  stable  behavior  will  be  effectively  rational 
only  with  limited  information;  some  misperceptions  may  remain  in  the  form  of  "igno- 
rance" ,  so  agents  will  not  necessarily  play  a  Nash  equilibrium  outcome  of  the  underlying 
full-information  game.  When  perceptions  are  observable,  agents  may  develop  misper- 
ceptions in  order  to  suppress  information  that  would  otherwise  reduce  the  payoff  to  all 
parties.^ 

The  rest  of  the  paper  is  organized  as  follows.  In  the  next  section,  we  give  a  number 
of  examples  that  illustrate  our  results,  and  also  discuss  related  literature  in  more  detail. 
Section  3  analyzes  evolution  in  a  single-agent  decision  problem  (without  strategic  inter- 
actions) more  formally.  Section  4  analyzes  a  multi-agent/game-theoretic  situation  and 
contains  the  main  results  of  the  paper.  Since  our  main  point  is  that  effectively  rational 
behavior  emerges  once  we  allow  agents  to  be  subjectively  irrational,  in  this  section,  we 
allow  perceptions  to  be  observed,  creating  the  most  favorable  environment  for  deviations 
from  rational  behavior  (see,  e.g.,  Dekel,  Ely,  and  Yilankaya,  1998).  Section  5  analyzes 
evolution  when  perceptions  are  not  observed.  Section  6  concludes. 

^This  result  is  obtained  because  perceptions  are  more  than  "cheap  talk":  they  also  determine  how 
individuals  process  data  from  the  real  world.  A  player  who  perceives  two  different  situations  in  exactly 
the  same  way  has  to  play  the  same  strategy  in  both  situations.  We  show  below  that  when  perceptions 
are  simply  "cheap  talk"  systematic  misperceptions  that  affect  behavior  cannot  arise. 
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2     Examples,  Outline  and  Related  Literature 

2.1      Examples  and  Outline 

Consider  the  following  2x2  game  and  interpret  the  payoffs  as  fitness  levels,  so  that 
evolutionary  dynamics  will  favor  strategies  that  have  higher  payoffs: 

Example  of  underlying  game      , 

(Game  1): 

r 

This  game  is  dominance  solvable  and  hence  has  a  unique  Nash  equilibrium  in  which 
player  1  chooses  r  and  player  2  plays  L.  Now  imagine  a  meta-game  in  which  players  have 
different  perceptions  of  their  payoffs — they  may  see  the  payoff  matrix  differently —  and 
an  evolutionary  process  determining  the  frequency  of  different  perceptions.  The  fitness 
of  different  perceptions  in  this  evolutionary  meta-game  are  still  given  by  the  payoffs  in 
the  underlying  game.  Moreover,  let  us  make  two  important  assumptions: 

1.  observability:  agents'  perceptions  are  observable  to  others  in  the  game  (for  ex- 
ample, the  perception  of  the  payoff  matrix  that  one  holds  is  engraved  on  their 
forehead).^ 

2.  subjective  rationality:  players  choose  whichever  action  maximizes  their  perceived 
payoffs. 


Then,  consider  a  type  for  player  1  who  perceives  this  game  instead  as 

Misperceiv 
(Game  2): 


1\2     L       R 
Misperceived  game      , 
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r 

This  type  mistakenly  perceives  that  I  is  a  dominant  strategy,  and  given  subjective 
rationality,  will  choose  I  over  r.  This  will  encourage  his  opponent  to  play  R,  and  so 
a  player  with  this  type  of  misperception  will  receive  the  "true  payoff"  of  2.  This  mis- 
perception  benefits  player  1  because  it  enables  him  to  commit  to  playing  his  dominated 
strategy  in  the  underlying  game,  thus  enticing  player  2  to  change  his  play.  Without  the 

"^See  Frank  (1988)  for  a  defense  of  the  view  that  perceptions  and  preferences  are,  at  least  to  some 
degree,  observable  to  others. 
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misperception,  player  1  could  not  have  committed  to  playing  his  dominated  strategy, 
and  player  2  would  have  preferred  to  respond  by  L. 

To  see  the  forces  shaping  the  evolution  of  perceptions,  we  simply  apply  standard 
evolutionary  reasoning  to  a  meta-game  where  we  allow  player  1  to  either  have  the  correct 
perception  of  Game  1  or  misperceive  it  as  Game  2  (in  other  words,  we  allow  mutations 
over  accurate  and  inaccurate  perceptions).  For  simplicity,  we  allow  no  misperceptions 
for  player  2.  To  highlight  the  choices  facing  player  2,  we  also  write  out  the  four  strategies 
for  player  2,  even  though  there  is  no  evolutionary  selection  over  these  actions  (instead, 
player  2  chooses  his  action  optimally  after  observing  the  perceptions  of  player  1).  Since 
players  are  subjectively  rational  given  their  perceptions,  the  payoffs  in  this  meta-game 
are 

1\2  LL     LR     RL     RR 

Meta-game  ,  , .       

^   ,  ,   , .         „  , .  accurate  perception 

oi  the  evolution  oi  perceptions: 

misperception 

Strategy  LL  for  player  2  corresponds  to  choosing  L  when  player  1  is  observed  to  have 
accurate  perceptions  and  also  when  he  is  observed  to  have  misperceptions.  Strategy 
LR  corresponds  to  choosing  L  when  player  1  has  accurate  perceptions,  but  R  when  he 
has  misperceptions.  The  payoffs  then  follow  immediately  from  Game  1,  incorporating 
the  fact  that  when  he  has  accurate  perceptions,  player  1  will  choose  r,  and  when  he 
misperceives  the  payoffs,  he  will  choose  /. 

The  best  play  for  player  1  is  clearly  to  develop  a  misperception,  inducing  him  to  play  a 
dominated  strategy,  I,  in  the  underlying  game.  The  best  response  of  player  2  is  to  choose 
LR.  This  generates  the  payoff  (2,2),  which  is  obviously  not  a  Nash  equilibrium  outcome 
of  the  underlying  game  (and  it  contains  the  play  of  a  dominated  strategy  along  the 
equihbrium  path).  Replicator  or  similar  monotonic  dynamics  will  therefore  encourage 
the  development  of  misperceptions. 

Intuitively,  when  player  2  sees  an  opponent  with  accurate  perceptions,  he  will  choose 
L,  anticipating  that  his  opponent  will  choose  r,  giving  both  players  (1,1).  In  contrast, 
when  faced  with  a  player  with  a  misperception,  he  will  correctly  reason  that  his  oppo- 
nent will  play  I,  and  he  will  choose  R,  leading  to  (2,2).  This  example  therefore  suggests 
that  evolutionary  pressures,  far  from  wiping  out  misperceptions,  may  fa\'or  apparently 
"nonrational"  behavior.  Specifically,  two  types  of  nonrational  behavior  arise  as  air  out- 
come of  the  evolution  of  perceptions  in  this  case:    first,  agents  may  have  systematic 


misperceptions;  second,  these  misperceptions  lead  to  the  play  of  strictly  dominated  (and 
hence  non-equilibrium)  strategies  in  the  underlying  game. 

The  role  of  the  observability  assumption  above  is  clear:  if  the  misperception  of  player 
1  were  not  observed,  it  could  not  act  as  a  commitment  device.  Specifically,  a  mutant 
(deviation)  without  misperception  would  choose  r  and  receive  the  higher  payoff  of  3  at 
the  expense  of  player  2. 

The  subjective  rationality  assumption  is  equally  important:  it  guarantees  that  the 
misperception  of  player  1  will  translate  into  a  specific  action,  encouraging  player  2  to 
change  his  behavior. '^  Is  subjective  rationality  reasonable  in  an  evolutionary  setting? 
Consider  the  choice  between  two  options,  A  and  B,  where  A  has  much  higher  payoff  (in 
the  extreme  case,  agents  who  choose  B  die).  But,  suppose  that  all  agents  have  developed 
a  misperception  and  think  that  B  is  preferable  to  A.  Now  imagine  selection  over  two 
different  types,  one  that  is  subjectively  rational  and  the  other  subjectively  irrational 
(in  the  sense  that  he  will  not  necessarily  play  the  actions  that  maximize  his  perceived, 
payoffs — in  fact,  here  we  take  the  subjectively  irrational  type  to  play  the  action  that 
minimizes  his  payoff).  The  subjectively  rational  type  will  choose  B  since  he  perceives 
it  to  be  preferable  to  A,  while  the  subjectively  irrational  type  will  choose  A  despite  the 
fact  that  he  perceives  B  to  yield  a  higher  payoff.  Evolution  will  favor  the  subjectively 
irrational  type,  choosing  A.  In  fact,  a  subjectively  irrational  type  with  this  misperception 
will  do  as  well  as  a  subjectively  rational  agent  with  accurate  perceptions,  who  perceives 
A  to  yield  higher  payoff  and  chooses  it.  This  simple  example  demonstrates  that,  in  the 
presence  of  misperceptions,  there  are  no  compelling  reasons  to  expect  evolution  to  favor 
subjectively  rational  agents. 

Let  us  now  return  to  the  analysis  of  Game  1  and  allow  players  not  only  to  misper- 
ceive  payoffs,  but  also  to  deviate  from  subjective  rationality  (for  example,  mutations 
that  change  how  a  given  perception  or  plan  is  mapped  into  behavior).  Our  major  result 
(Proposition  5)  states  that  in  this  case  where  we  allow  subjective  irrationality,  evolution- 
ary forces  will  take  us  towards  "effectively  rational  play" — i.e.,  towards  behavior  that  is 

^To  many  economists  and  game  theorists,  subjective  rationality  is  natural,  almost  tautological: 
preferences,  through  the  axiom  of  revealed  preferences,  simply  represent  choices.  But  in  an  evolutionary 
model  where  preferences  (perceptions)  can  be  observable  at  the  beginning  of  the  game,  this  revealed- 
preference  notion  does  not  apply,  since  we  cannot  possibly  observe  the  representation  of  the  choice 
before  the  choice  has  been  made. 
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indistinguishable  from  that  resulting  from  rational  play  and  does  not  feature  dominated 
strategies  given  the  information  set. 

To  better  understand  this  result,  let  us  now  return  to  the  above  meta-game  and  allow 
4  different  "strategies"  for  player  1:  accurate  perception  and  subjective  rationality;  mis- 
perception  and  subjective  rationality;  accurate  perception  and  subjective  irrationaUty; 
and  misperception  and  subjective  irrationahty.  Here  by  misperception  we  mean  that 
player  1  perceives  Game  2  when  the  true  game  is  Game  1,  while  by  subjective  irrational- 
ity we  simply  mean  player  1  playing  the  dominated  strategy  in  the  perceived  game.  We 
allow  no  misperceptions  for  player  2. 

Meta-game  of  the  evolution 

of  perceptions  and  play: 

1/2        LL     LR    RL     RR 
accurate  perception  and  subjective  rationality 
misperception  and  subjective  rationality 
accurate  perception  and  subjective  irrationality 
misperception  and  subjective  irrationality 

Again  the  first  letter  for  player  2  denotes  his  action  when  he  observes  player  1  with 
accurate  perception,  and  the  second  letter  gives  his  action  following  a  misperception  by 
player.  For  example,  if  player  1  has  misperceptions  and  is  subjectively  rational  (row  2), 
he  will  play  I,  so  when  player  2  chooses  L,  the  outcome  is  (0,0)  and  when  he  chooses 
i?,  the  outcome  is  (2,2).  In  contrast,  if  player  1  has  a  misperception  and  is  subjectively 
irrational  (row  4),  he  perceives  I  to  give  higher  payoff,  but  still  plays  r.  Now  if  player  2 
chooses  L,  the  outcome  is  (1,1),  and  if  he  chooses  R,  the  outcome  is  (3,0). 

In  this  meta-game,  the  outcome  (2,2)  is  no  longer  (evolutionarily  or  neutrally)  stable. 
To  see  this,  first  consider  the  case  that  player  1  misperceives  the  payoffs  and  is  subjec- 
tively rational  and  player  2  plays  LR  (or  RR),  leading  to  (2,2).  Imagine  a  mutation 
of  player  1  that  leads  to  subjective  irrationality,  while  still  keeping  the  misperception. 
Recall  that  whether  player  1  has  accurate  perceptions  is  seen  by  player  2,  but  whether 
he  is  subjectively  rational  or  not  is  not  observed.  Therefore,  with  both  LR  and  RR, 
player  2  will  continue  to  play  R,  but  the  mutant  player  1  will  respond  with  r  and  obtain 
the  higher  true  payoff  3,  taking  the  system  away  from  (2,2).  Similarly,  the  incumbent 
population  in  which  player  1  has  accurate  perceptions  but  behaviorally  irrational  and 
player  2  plays  RL  (or  RR)  is  invaded  by  the  behaviorally  rational  mutants  with  ac- 
curate perceptions.   It  is  therefore  straightforward  to  verify  that  any  rest  point  of  our 


evolutionary  process  will  yield  the  outcome  (1,1) — the  Nash  equihbrium  outcome  of  the 
underlying  game.  This  can  be  achieved  as  a  combination  of  player  1  developing  an  ac- 
curate perception  and  subjective  rationality  and  player  2  responding  with  LL,  or  player 
1  developing  a  misperception  and  subjective  irrationality  and  player  2  again  responding 
with  LL. 

In  essence,  when  agents  are  hmited  to  subjective  rationality,  misperceptions  provide 
commitment.  However,  once  we  allow  the  evolution  of  "play"  (how  perceptions  are 
mapped  into  behavior)  as  well  as  perceptions,  players  can  undo  their  misperceptions  by 
modifying  their  degree  of  subjective  rationality.  This  destroys  the  commitment  value 
of  misperceptions.  The  crucial  difference  here  is  between  the  observability  of  misper- 
ceptions and  the  non-observabihty  of  play.  As  is  standard  in  game  theory,  we  do  not 
(and  cannot)  allow  strategies  to  depend  on  the  strategies  of  other  players.  So  wliile 
which  action  a  player  chooses  can  depend  on  his  and  his  opponents'  perceptions  and 
past  actions,  it  cannot  depend  on  the  exact  mapping  that  his  opponents  use  to  translate 
these  perceptions  into  actions  (e.g.,  it  cannot  depend  on  their  actions  in  the  future  or 
the  actions  they  could  have  taken  in  other  situations). 

The  sense  in  which  the  resulting  behavior  is  "rational"  needs  some  elaboration.  Evo- 
lutionary forces  take  us  back  to  equihbrium  outcomes  of  the  underlying  game,  but  this 
could  be  supported  either  by  accurate  perceptions  and  subjective  rationality,  or  by  mis- 
perceptions and  subjective  irrationality.  This  explains  our  choice  of  terminology:  "effec- 
tively" rational  behavior  may  be  a  combination  of  inaccurate  perceptions  and  subjective 
irrationality.  So,  in  practice  we  may  observe  agents  with  systematic  biases  in  how  they 
perceive  the  world  or  how  they  make  decisions.  But  these  biases  will  be  undone  by  other 
biases  they  develop  and  will  not  shape  their  important  decisions  (their  decisions  affecting 
long-run  fitness).  The  notion  of  "rational  behavior"  that  emerges  is  therefore  weaker 
than  the  usual  definition  of  rationality  as  it  allows  misperceptions.^ 

Yet  agents  are  also  rational  in  the  stronger  sense  of  effectively  having  a  common 

prior.  Although  agents  may  appear  to  have  systematic  misperceptions,  viewing  the  same 

situation  in  different  ways,  they  will  behave  as  if  they  all  share  the  same  beliefs.   For 

example,  suppose  a  certain  behavior,  say  risk-taking,  has  a  fitness  benefit.  Some  agents 

^It  is  also  weaker  than  the  usual  definition  because  it  requires  behavior  to  be  effectively  rational 
with  limited  information — that  is,  rational  given  the  information  set  of  agents,  which  may  not  feature 
full  information.  See  below. 


may  view  this  behavior  as  "too  risky" ,  while  others  view  it  as  "fun"  and  well  worth  the 
risk.  But  at  the  end,  both  groups  will  undertake  the  activity  as  if  they  had  the  same 
beliefs  (and  preferences).  In  this  sense,  our  analysis  also  provides  a  foundation  for  the 
"common  prior"  assumption:  even  though  agents  may  have  different  perceptions,  they 
will  act  as  if  they  have  effectively  common  priors. 

Consider  an  example  to  illustrate  this  point  further.  Suppose  that  each  of  two  agents 
have  to  take  one  of  three  actions,  a,  b  and  c,  and  there  are  three  possible  states  of  the 
world,  A,  B  and  C.  For  simphcity,  ignore  strategic  interactions  and  assume  that  each 
agent's  payoff  is  only  a  function  of  his  own  action  and  the  state  of  the  world.  Each  agent 
receives  a  payoff  of  1  when  the  action  matches  the  state  of  the  world,  and  a  payoff  of  0 
otherwise.  Clearly,  one  possible  evolutionary  equilibrium  involves  each  agent  recognizing 
the  underlying  state  of  the  world  correctly,  and  choosing  the  action  corresponding  to 
the  state  of  the  world  he  perceives  (e.g.,  a  if  the  state  of  the  world  is  A).  In  this 
case,  both  agents  will  share  the  same  perception.  However,  under  the  assumption  that 
each  agent's  perceptions  are  observable,  there  are  other  evolutionary  equilibria  where 
agents  will  have  different  (and  inaccurate)  perceptions,  but  act  as  if  they  have  common 
perceptions.  For  example,  suppose  that  both  agents  perceive  A  and  C  accurately,  but 
agent  1  misperceives  B  as  A,  and  agent  2  misperceives  B  as  C .  In  this  case,  neither 
agent  has  accurate  perceptions,  and  moreover,  their  perceptions  differ.  But  consider  the 
following  behavior  for  both  agents:  play  a  if  agent  2's  perception  is  A,  play  b  if  agent 
I's  perception  is  A  and  agent  2's  perception  is  C,  and  play  c  if  agent  I's  perception 
is  C.  This  perception-play  combination  will  give  the  same  evolutionary  fitness  as  the 
perception-play  combination  where  each  agent  recognizes  the  underlying  world  correctly 
and  responds  to  it.  Therefore,  in  this  simple  example,  it  is  possible  for  both  players  to 
have  different  (and  incorrect)  perceptions,  but  act  as  if  they  have  common  priors,  and 
obtain  the  maximum  payoffs. 

Finally,  it  is  useful  to  give  an  example  where  behavior  is  effective  rational  only  with 
limited  information,  i.e,  where  it  does  not  correspond  to  equilibrium  play  of  the  un- 
derlying full-information  game.  This  can  happen  because  of  suppression  of  infommtion 
by  all  players.  To  illustrate  this,  consider  once  again  a  simple  example.  Suppose  that 
players  1  and  2  play  the  following  game: 


Suppression  of  information 
(Game  3): 


1\2 


L 

R 

3,3 

-1,1 

w',-5 

0,0 

Here  if  is  a  random  variable,  which  takes  the  value  0  and  5,  each  with  probabihty 
1/2.  Both  players  observe  w  before  making  their  decisions.  It  is  clear  that  when  w  = 
5,  the  unique  Nash  equihbrium  outcome  is  (0,0),  while  when  w  —  0,  there  is  also  a 
Nash  equilibrium  with  players  receiving  (3,3).  Now  imagine  both  players  develop  the 
misperception  that  w  =  0  aB.  the  time.  Then,  the  (true)  payoffs  of  the  perceived  game 
become 


1\2     L 


R 


3,3 

-1,1 

2.5,-5 

0,0 

and  there  is  now  an  equilibrium  with  {I,  L)  all  the  time.  So  in  this  game  developing 
a  particular  systematic  misperception  to  suppress  information  will  benefit  both  players 
and  will  not  be  eliminated  by  evolutionary  forces.  Because  the  agents  are  still  playing 
according  to  the  equilibrium  of  some  coarsend  version  of  the  underlying  game,  the  out- 
come is  still  effectively  rational  with  this  limited  information,  but  does  not  correspond 
to  the  Nash  equilibrium  of  the  underlying  full-information  game.^ 

2.2     Related  Literature 

This  paper  is  related  to  a  large  body  of  research  on  evolutionary  game  theory.  \Vhether 
evolution  will  lead  to  Nash  equilibrium  is  a  central  question  of  this  literature,  and  it 
is  well-known  that  any  rest  point  of  a  monotonic  evolutionary  process  over  types  pro- 
grammed to  play  different  strategies  corresponds  to  a  Nash  equilibrium  outcome  (see 
Weibull,  1997,  or  Samuelson,  1997,  for  surveys). 

To  the  best  of  our  knowledge,  however,  there  has  been  little  work  on  meta-games 
where  evolution  occurs  over  perceptions  and  the  mapping  of  perceptions  into  actions. 
The  notable  exceptions  are  Blume  and  Easley  (1992)  and  Sandroni  (2000)  who  analyze 

^The  results  regarding  suppression  of  information  rely  on  the  assumption  that  there  is  a  perfect 
mapping  between  players'  true  perceptions  and  what  others  observe  about  the  players.  When  percep- 
tions are  not  observable,  there  will  still  be  evolutionary  forces  leading  to  effectively  rational  behavior, 
but  we  no  longer  obtain  the  suppression  of  information  as  an  evolutionarily  stable  outcome. 
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the  benefits  of  developing  accurate  perceptions  in  non-game-theoretic  situations,^"  and 
Dekel,  Ely  and  Yilankaya  (1998),  who  study  the  evolution  of  preferences  under  the 
(implicit)  assumption  of  subjective  rationality.  Dekel  et.  al.  (1998)  show  that,  when 
agents'  preferences  are  observable,  any  "stable"  outcome  will  yield  the  same  average 
payoff  as  that  of  the  "eflicient  strategy."  For  example,  in  the  Prisoners'  Dilemma  game, 
the  unique  stable  outcome  will  be  mutual  cooperation.^^ 

A  number  of  contributions,  including  Banerjee  and  Weibull  (1993),  Dekel,  Ely  and 
Yilankaya  (1998),  Ely  and  Yilankaya  (1999),  and  Ok  and  Vega-Rodondo  (2000)  show 
that  in  these  types  of  models,  when  preferences  are  not  observed,  evolution  takes  us  back 
to  Nash  equilibria.  Our  results  are  related  to  these  findings,  but  enable  us  to  analyze  the 
co-evolution  of  perceptions  and  play.  Furthermore,  they  are  not  driven  by  the  assumption 
that  preferences  are  unobserved.  In  fact,  for  the  most  part,  we  assume  that  perceptions 
are  perfectly  observed.  Instead,  our  results  follow  because  we  relax  the  "subjective 
rationality"  assumption  embedded  in  many  analyses  of  evolution  of  preferences.  As  a 
result,  evolution  favors  effectively  rational  play,  but  does  not  wipe  out  misperceptions 
or  systematic  subjective  irrationality.  Agents  can  hold  systematic  misperceptions,  and 
in  interviews  and  experimental  settings  will  display  a  variety  of  biases.  This  type  of 
behavior  would  not  be  consistent  with  models  of  evolution  of  preferences,  which  leads 
to  the  same  preferences  for  all  agents. 

We  believe  that  an  analysis  of  the  evolution  of  perceptions  and  play  is  interesting. 
in  part,  because  the  whole  discipline  of  psychology  and  the  recent  emerging  field  of 
behavioral  economics  are  explicitly  concerned  with  agents'  perceptions  and  how  these 
affect  behavior.  Moreover,  such  an  analysis  enables  us  to  relax  the  subjective  rational- 
ity assumption,  which  does  not  have  good  evolutionary  foundations  in  the  presence  of 

'°For  example,  in  Blume  and  Easley  (1992)  and  Sandroni  (2000)  agents  make  forecasts  and  saving 
decisions,  which  affect  market  prices,  but  there  is  no  further  game-theoretic  interaction.  Bhnne  and 
Easley  (1992)  compute  agents'  wealth  in  the  long-run  and  drive  a  corresponding  fitness  function.  They 
show  that,  when  the  preferences  do  not  coincide  with  this  fitness  function,  rational  agents  may  lose  their 
wealth  and  "irrational"  agents  may  determine  the  prices  in  the  long  run,  consistent  with  our  argument 
that  subjective  rationality  does  not  have  an  evolutionary  basis  when  there  are  misperceptions.  In  the 
same  vein,  in  an  overlapping-generation  model  with  risk  averse  agents,  De  Long  et  al.  (1991)  suggest 
that  agents  with  inaccurate  beliefs  about  the  future  prices  may  drive  the  agents  with  accurate  beliefs 
out  when  the  strategies  imitated  on  the  basis  of  ex-post  wealth  they  generate  rather  than  the  ex-ante 
expected  utility  level. 

'^  Similarly,  Kockesen  et  al.,  1999,  analyze  a  class  of  games  in  which  evolution  rewaxds  "negatively 
interdependent  preferences". 
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misperceptions.  Whether  subjective  irrationahty  is  imposed  or  not  has  important  impli- 
cations regarding  what  types  of  behavior  wiU  be  evolutionarily  stable.  Also,  using  this 
framework,  we  can  investigate  whether  players  have  a  common  prior,  which  is  of  central 
importance  to  the  game  theory  literature. 

Our  analysis  is  most  closely  related  to  Waldman  (1994)  and  to  recent  independent 
work  by  Samuelson  (2001).  Waldman  (1994)  analyzes  evolution  in  a  world  without 
strategic  interactions  (similar  to  the  single  agent  case  below),  but  with  two  characteristics 
and  sexual  inheritance.  He  shows  that  in  the  presence  of  sexual  inheritance,  which 
imphes  that  the  offsprings  of  successful  agents  are  not  their  perfect  replicas,  evolution 
may  pick  characteristics  that  are  "best  responses"  to  each  other,  but  may  not  be  the 
global  maximrmi.  Using  this  framework,  he  illustrates  how  agents  may  be  stuck  with 
utility  functions  that  do  not  maximize  their  fitness,  and  compensate  for  this  by  making 
other  systematic  mistakes.  Samelson  (2001)  is  closely  related  to  Waldman  (1994),  as 
he  illustrates  that  in  the  presence  of  deviations  from  Bayes'  rule,  an  agent  can  increase , 
his  evolutionary  fitness  by  distorting  his  utility  function  (e.g.,  by  including  the  others' 
consumption  levels  to  his  own  utility  function) .  It  is  worthwhile  to  note  that  our  notion 
of  "effective  rationality  despite  misperceptions"  is  somewhat  different  from  Samuelson's 
and  Waldman's  notions.  To  see  this,  consider  an  agent  who  maximizes  his  fitness  by 
distorting  his  utility  function.  According  to  our  terminology,  this  agent  has  "accurate 
perceptions,"  since  he  will  identify  his  preferred  actions  with  those  that  maximize  his 
fitness.  In  contrast,  agents  with  inaccurate  perceptions  in  our  setup  will  report  to  prefer 
actions  that  do  not  maximize  their  fitness,  but  nevertheless  end  up  taking  actions  that 
are  good  for  their  long-run  fitness. 

Finally,  our  results  are  also  related  to  analyses  of  evolutionary  dynamics  with  pre- 

play  communication  (e.g.,  Warneyrd,  1993,  Kim  and  Sobel,  1995,  Banerjee  and  Weibull, 

2000)  because  perceptions  are  similar  to  signals  sent  by  players  before  the  game  is 

played.  In  these  models,  when  observed  signals  do  not  restrict  agents'  characteristics  or 

behavior,  neutral  stability  leads  to  Nash  equilibrium  outcomes  of  the  imderlying  game 

(see  Banerjee  and  Weibull,  2000).^^  Nevertheless,  perceptions  are  different  from  cheap 

talk  because  they  also  encapsulate  what  players  know  about  the  world,  and  whether 

When  observed  signals  restrict  agents'  behavior,  evolution  may  lead  to  non-Nash  outcomes,  as  in 
Robson  (1990),  and  to  strictly  dominated  strategies  as  in  Banerjee  and  Weibull  (1993). 
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they  can  distinguish  between  different  worlds.  In  particular,  when  the  underlying  game 
consists  of  many  different  worlds  or  subgames — as  we  have  here,  a  player's  perceptions 
will  restrict  his  possible  plays  by  confining  him  to  play  the  same  (mixed)  sub-strategy 
at  any  two  worlds  that  are  perceived  to  be  the  same.  This  is  important  in  some  of  our 
results,  in  particular,  for  the  result  that  jointly  beneficial  suppression  of  information  may 
survive  evolutionary  pressure. 

3     The  Single-agent  case 

We  start  by  analyzing  the  evolution  of  perceptions  and  play  in  a  single-agent  decision 
problem — i.e.,  in  a  setting  without  strategic  interactions.  This  analysis  will  introduce 
some  of  the  key  concepts  and  illustrate  the  reasoning  we  use  in  the  multi-agent  case. 

Throughout  the  paper  we  consider  a  large  set  of  agent  types  subject  to  evolution- 
ary selection,  and  assume  that  there  is  a  process  of  replicator  dynamics  that  relates 
the  frequency  of  different  agent  types  to  a  monotonic  fimction  of  their  fitness.  We  are 
interested  in  the  stable  outcomes  of  the  replicator  dynamics.  Like  many  other  evolu- 
tionary analyses,  we  limit  ourselves  to  Neutrally  Stable  Strategies,  which  correspond 
to  Lyapunov  stable  points  of  (strictly)  monotonic  replicator  dynamics  (see  Bomze  and 
Weibull,  1995,  or  Weibull,  1997). 

Assume  a  world  where  we  have  a  set  C  of  consequences  and  a  finite  set  S  of  strategies 
(actions)  s  :  VL  ^^  C  with  uncertain  consequences  for  some  state-space  O.  The  set  of 
possible  "worlds"  is  assumed  to  be  finite  and  denoted  by  W  with  a  generic  member  w. 
Intuitively,  different  worlds  correspond  to  different  types  of  decision  problems  that  the 
individual  is  faced  with,  and  the  same  action  may  have  different  payoff  consequences  in 
different  worlds.  We  denote  the  probability  distribution  over  W  by  Q,  with  Q  {w)  >  0 
for  each  w.  The  expectation  with  respect  to  Q  will  be  denoted  by  Eq.  For  each  w  G  W, 
we  consider  a  pair  (u,^,^^,)  of  a  utility  function  Uu,  :  C  ^  R  on  the  consequences,  and 
a  probability  distribution  g^,  on  fl.  The  pair  (uu,,g,o)  represents  the  base  preferences  on 
actions,  or  the  agent's  evolutionary  fitness.  From  a  strict  evolutionary  point  of  view, 
g^,  measures  the  frequency  of  the  events  and  u^^  (c)  is  the  evolutionary  fitness  following 
consequence  c.  We  summarize  the  base  preferences  of  the  agent  by 

V  (s;  w)  =  Eg^,  {u^  o  s) 
13 


at  each  s  E  S  and  w  G  W,  where  Eg  denotes  the  expectation  with  respect  to  q.  This 
notation  will  be  used  extensively  in  the  next  section. 

Analogously  to  the  base  preferences  of  the  agent,  for  any  probability  distribution  q 
and  preferences  u,  we  define  the  payoff  function 

V  (s)  =  Eg  {u  o  s) 

This  function  gives  the  ("perceived")  payoff  of  choosing  action  s  given  preferences  u 
and  probability  distribution  q.  We  consider  a  finite  set  V  C  M'^  of  payoff  functions 
V  :  S  —^  M.  We  define  a  perception  as  a  function 


TT 


:W  -^V. 


Intuitively,  a  perception  tt  {lu)  is  an  interpretation  of  a  world  w  as  a  (possibly  different) 
world  w'.  The  set  of  actual  worlds  that  can  be  realized  is  naturally  contained  in  the  set 
of  possible  worlds  that  the  agent  can  perceive  (i.e.,  V  D  {v  (■,  u;)  ;  tu  G  W}). 

We  call  a  perception  n*  with  tt*  (w)  =  v{-,w)  at  each  w  as  accurate.  Intuitively, 
an  accurate  perception  enables  the  decision-maker  to  correctly  recognize  the  world  he 
is  confronted  with.  Given  any  n  and  uj,  we  say  that  there  is  a  misperception  at  w  if 
IT  (w)  7^  v{-,w).  This  corresponds  to  either  to  the  perceived  probability  distribution 
differing  from  the  underlying  frequencies,  i.e.,  g  ^  g^,,  or  the  perceived  utility  function 
differing  from  fitness  function,  i.e.,  u  ^  Uyj.^^ 

Every  time  a  world  w  is  drawn,  the  agent  perceives  the  world  as  tt  [w]  and  takes  an 
action  s  E  S,  determining  his  fitness.  Any  function  a  :  V  —^  S  is  called  a  play,  and  the 
set  of  all  plays  is  denoted  by  E  =  5^. 

Definition  1  A  play  a  is  said  to  be  subjectively  rational  if  and  only  if 

a  {v)  G  arg  max  v  (s) . 

A  subjectively  rational  play  maximizes  the  agent's  perceived  payoff.  This  is  the 
standard  definition  of  rationahty  in  Game  Theory.  For  many,  this  is  a  tautology,  for  the 


'■^Accurate  perceptions  may  also  correspond  to  a  pair  {u,q)  ^  {uw,<lw)  of  inaccurate  probability 
distribution  and  an  inaccurate  utility  function,  such  that  Eq^,  {uu,  o  s)  =  Eq(uos).  For  an  example, 
see  Samuelson  (2001). 
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perceptions  (or  beliefs)  are  merely  expressions  of  the  choice,  derived  from  the  choice  f\inc- 
tion  using  the  principle  of  revealed  preferences.  In  this  paper,  we  take  the  perceptions 
and  choice  independent.  We  will  take  cr*  as  a  generic  subjectively  rational  play. 

Definition  2  A  pair  (tt,  a)  of  a  perception  and  play  is  said  to  be  effectively  rational 
(with  full  information)  if  and  only  if  a  o  n  =  a*  o  n*  for  some  subjectively  rational  a* 
and  accurate  rr* ,  i.e., 


a  (tt  {w))  G  arg  max  v  (s;  w) 
ses 


at  each  w  G  W. 


That  is,  a  pair  (tt,  a)  of  a  perception  and  a  play  is  effectively  rational  if  we  can  never 
distinguish  the  resulting  choice  from  that  of  an  individual  who  perceives  accurately  and 
behaves  rationally.  Here,  we  use  a  strong  notion  of  accurate  perception;  it  implies  that 
the  agent  distinguishes  each  state  from  the  others  (hence  our  qualification,  with  full 
information).  In  our  next  section,  we  will  also  define  effective  rationality  with  limited 
information,  which  only  requires  an  agent  to  maximize  his  expected  fitness,  given  the 
information  available.  Our  next  example  illustrates  that  there  will  be  many  combinations 
of  misperception  and  subjectively  irrational  play  that  yield  a  rational  choice. 

Exajnple  1  Consider  the  case  W  —  {w},  S  =  {a,b}  with  v  (a)  =  1  and  v  (b)  =  0. 
Suppose  also  that  V  =  {vi,V2}  such  that  vi  (a)  =  1  and  v^  (6)  =  0,  and  V2  (a)  =  0  and 
V2  (6)  =  1.  Therefore,  Vi  corresponds  to  accurate  perception,  while  V2  is  a  misperception. 
Then,  there  are  four  combinations  of  perceptions  and  play: 

1.  agent  perceives  accurately  plays  subjectively  rationally,  and  chooses  a; 

2.  he  inaccurately  perceives  b  better  than  a,  and  plays  subjectively  rationall}-,  choos- 
ing b; 

3.  he  accurately  perceives  a  better  than  b,  but  plays  subjectively  irrationally,  and 
chooses  b; 

4.  he  inaccurately  perceives  b  better  than  a,  but  plays  subjectively  irrationally,  aiad 
chooses  a. 
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Combinations  1  and  4  are  effectively  rational  with  full  information,  and  yield  higher 
fitness  than  that  of  the  other  two  perception-play  combinations.  With  any  monotonic 
rephcator  dynamics,  effectively  rational  agents  will  survive,  while  the  rest  will  die  away. 
Some  of  these  effectively  rational  players  will  hold  misperceptions,  they  will  make  clearly 
irrational  choices  given  their  perceptions,  but  the  overall  choice  will  maximize  their 
fitness. 

More  importantly,  the  concept  of  subjective  rationality  loses  its  appeal  in  the  presence 
of  misperceptions:  the  subjectively  irrational  agents  in  4  do  better  than  the  subjectively 
rational  agents  in  2.  Therefore,  in  a  world  where  the  agents  may  have  misperceptions, 
subjective  rationality  does  not  necessarily  have  an  evolutionary  foundation. 

Now,  given  a  play  cr,  we  can  define  effectively  accurate  perceptions,  which  are  not 
necessarily  accurate. 

Definition  3   Given  any  -play  a,  a  perception  tt  is  said  to  be  (effectively)  a-accurate  if 
and  only  if  a  o  ir  =  a  o  tt*  . 

With  this  terminology,  (tt,  a)  is  effectively  rational  if  and  only  if  n  is  cr-accurate. 
Given  any  a,  we  define  the  one-person  meta-game 

where 

g  (tt:  a)  =  Eq  [v  {a  (tt  {w))  ;  w)]  =  Eq  [E^^,  [u^  {a  (tt  {w)))]] 

at  each  tt  e  11.  Note  that  g{TT]a)  is  the  agent's  expected  payoff  with  respect  to  the 
base  preferences,  given  that,  at  each  w,  the  agent's  perceptions  will  be  n  {w)  and  he 
will  play  a  and  choose  a  (tt  («'))•  ^^  game  GV,  cr  is  fixed  but  the  perceptions  can  vary. 
Even  though  G^j  is  technically  a  game  with  a  strategy  space  11,  it  is  not  a  usual  game 
since  the  perceptions  tt  G  11  are  not  chosen;  the  agent  happens  to  have  them,  perhaps 
because  they  inherited  them  or  as  a  result  of  mutations.  For  this  reason,  we  refer  to  it 
as  a  meta-game.  When  g  (tt;  a)  <  g  (tt';  a),  given  that  he  will  choose  according  to  a,  his 
evolutionary  fitness  will  increase  if  his  perceptions  happen  to  change  from  tt  to  tt'. 
We  also  consider  the  meta-game 

G  =  (nxE,p) 
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where  g  (tt;  a)  is  defined  as  above.   This  game  depicts  a  situation  where  both  the  per- 
ceptions and  the  play  evolve  together. 

Definition  4  Given  any  play  a,  a  perception  tt  G  11  ?s  Neutrally  Stable  (NS)  for  G^  iff 
g{-n\a)  >  g{7r';a)  at  each  tt'  G  FI.  Likewise,  (tt,  cr)  G  IT  x  E  is  NS  for  G  iff  giTT^a)  > 
g{-K';a')  at  each  (7r',(T')  G  IT  x  E. 

This  definition  simply  states  that  a  perception,  or  a  combination  of  perception  and 
play,  will  be  Neutrally  Stable  if  no  other  alternative  gives  higher  fitness.  Therefore,  we 
require  that  there  are  no  forces  taking  the  system  away  from  this  configuration.  This  is  a 
simpler  version  of  the  definition  of  Neutral  Stability  we  give  below  for  meta-games  with 
strategic  interactions.  Our  interest  in  Neutrally  Stable  outcomes  is  motivated  by  the 
fact  that  Lyapunov  stable  outcomes  with  respect  to  any  monotonic  replicator  dynamics 
will  be  Neutrally  Stable  (see,  e.g.,  Weibull,  1997). 

Proposition  1   Given  any  subjectively  rational  cr* ,  any  perception  tt  ^Yl  is  NS  for  G^. 
iff  it  IS  a* -accurate. 

The  proof  of  Proposition  1  is  combined  with  that  of  Proposition  2.  Proposition  1 
states  that  under  subjective  rationality,  the  siuviving  agents  will  hold  accrurate  percep- 
tions (and  will  choose  effectively  rational  behavior).  This  result  implies  that  the  example 
of  misperceptions  we  found  in  Game  1  of  Section  2  is  due  to  strategic  interactions  and 
cannot  happen  in  the  single-agent  case.  This  is  intuitive,  since  as  noted  in  Section  2, 
misperceptions  are  useful  because  they  provide  commitment,  and  in  this  single- agent 
meta-game,  commitment  has  no  value. 

Next,  we  see  that  when  the  requirement  of  subjective  rationality  is  relaxed,  there 
will  be  some  other  effectively  rational  agents  who,  despite  their  inisperceptions,  do  as 
well  as  these  agents. 

Proposition  2  Any  (tt,  cr)  G  11  x  E  is  Neutrally  Stable  for  G  iff  it  is  effectively  rational 
with  full  information. 

Proof.  Take  any  effectively  rational  (vr,^)  and  any  (7r,cr)  G  11  x  E.  At  each  w  G  W, 
by  definition,  we  have  v  {a  {tt  {w))  ]  w)  —  niax^gs  u  (a;  ly)  >  v  {a  {tt  {w)) ;  w) .  Hence, 
g  (fr,  a)  =  Eq  [v  {a  {tt  (u;))  ;  w)]  >  Eq  [v  {a  {tt  {iv)) ;  iv)]  =  g  (tt,  a),  showing  that  (tt,  a)  is 
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Neutrally  Stable  for  G.  Conversely,  if  (tt,  a)  is  not  effectively  rational,  then  the  former 
inequality  will  be  strict  at  some  Wq  with  Q  {ivq)  >  0,  rendering  the  latter  inequahty 
strict,  and  showing  that  (7r,cr)  is  not  Neutrally  Stable  for  G. 

To  prove  Proposition  1,  observe  that  (tt,  a*)  is  effectively  rational  iff  tt  is  (T*-accurate. 
Hence,  taking  a  =  a  =  a*  above,  we  obtain  Proposition  1.    ■ 

4     The  Multi-person  Case 

In  this  section  we  analyze  evolution  when  there  are  many  players  strategically  interacting 
with  each  other. ^"^  In  that  case,  when  the  perceptions  are  observable,  under  subjective 
rationahty,  a  player  may  gain  from  his  misperception,  a  fact  that  we  established  in 
Section  2  using  Game  1  as  an  example.  Nevertheless,  we  have  also  seen  that  in  the 
presence  of  misperceptions,  subjective  rationality  is  no  longer  supported  by  evolution. 
Therefore,  here  we  focus  on  a  process  in  which  both  perceptions  and  play  evolve  together. 

4.1      The  environment 

We  consider  a  set  A'^  =  {1,  2, . . . ,  n}  of  players  and  a  set  5*  =  YlieN  "^^  of  strategy  profiles. 
For  each  lu  G  W,  we  consider  an  underlying  game  {N,  S,v  {-^w)),  where  v{s,w)  — 
{v^  (s,  w)  ,v'^  {s,w) , . . . ,  {;"  (s,  w))  G  R"  is  the  players'  base  payoiT  vector  at  a  strategy 
profile  s  E  S.  Here  i)^  (s,  lu)  measures  the  evolutionary  fitness  of  a  player  t  E  N  when 
the  strategy  profile  s  is  played  at  lu.  The  set  of  all  feasible  payoff  functions  is  taken  to 
be  a  finite  set  V  C  (R")    that  contains 
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for  each  non-empty  W  C  W. 

By  a  perception,  we  mean  any  function  from  it  :  VV  ^  V.  The  set  of  all  perceptions 
is,  once  again,  denoted  by  H  =  V^^\  By  a  perception  profile,  we  mean  any  vector 
TT  =  (7ri,7r2, . . .  ,7r„)  G  H'^  of  perceptions.  We  write  tt^  {s,w)  for  the  payoff  of  player 
j  as  perceived  by  player  i  when  strategy  profile  s  is  played  at  world  w.  By  a  mixed 
perception  we  mean  any  probability  distribution  x  on  the  set  of  perceptions.  Here,  x  [n) 
can  be  considered  as  the  proportion  of  population  who  has  perception  n. 

'■"This  corresponds  to  frequency-dependent  evolution  in  the  terminology  of  Maynard-Smith  (1982). 
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We  focus  on  games  where  the  realization  of  the  payofT  perceived  by  each  player, 
TT,  {-.w)  for  all  i,  is  observable.  That  is,  all  agents  see  the  payoff  matrix  as  perceived  by 
other  agents.  This  is  motivated  by  the  discussion  in  the  introduction  where  we  pointed 
out  that  nonrational  behavior  is  most  likely  to  arise  as  an  evolutionary  stable  outcome 
when  perceptions/preferences  are  observed.  With  this  assumption,  a  player's  strategy 
will  be  a  function  that  maps  from  the  perception  profiles  into  strategies:  cr^  :  V^  — > 
A  (Si),  where  A  (S^)  denotes  the  space  of  the  probability  distributions  on  5j.  Notice  that 
any  such  strategy  profile  a  :  V^  — +  A  (5)  can  also  be  thought  as  a  solution  concept,  since 
it  determines  how  the  game  will  be  played  given  the  perceptions.  It  is  also  important 
that  it  is  the  realizations  of  the  perceptions,  tt,  (-jw),  not  the  perception  function,  tt,, 
itself  that  is  observable. ^^ 

We  consider  a  process  in  which  at  each  date  each  player  is  randomly  allocated  to  play 
role  z  in  a  randomly  selected  imderlying  game  (A'^,  S,v  {■■,iu)).  Each  agent  with  perception 
TT  perceives  the  payoff  function  as  tt  {w).  We  then  have  a  profile  (tti  (w)  ,  •  •  •  ,  7r„  {w)) 
of  perceived  payoff  functions  where  tt,  (w)  denotes  the  payoff  function  perceived  by  the 
player  who  plays  role  i.  Observing  his  role  i  and  the  profile  (tti  (w)  ,  •  •  •  ,7r„  {w))  €  V" 
of  perceived  payoff  functions,  a  selected  player  plays  some  (possibly  mixed)  strategy' 
CTj  (tti  {w)  ,  •  •  •  ,  7r„  {iv})  G  A  {Si)  in  the  underlying  game  {N,  S,  v  (•;  w)).  Given  any  v  G 
V"  and  any  s,  G  S",,  a^  (5j,w)  denotes  the  probabihty  of  playing  s,  according  to  at  [v)  G 
A  (5",).  Since  a  already  depends  on  the  player's  own  perceptions,  a  inixed  strategy  can 
be  defined  simply  as  a  pair  (x,a)  of  a  mixed  perception  and  a  mixed  play.  Finally, 
given  any  two  strategies  (x,  a)  and  (y,  //),  the  mixture  A(x,  a)  +  (1  —  A)  (y,  /j,)  =  (Ax  + 
(1  —  A)  y,  Aa  +  (1  —  A)  /_i)  for  any  A  G  [0, 1]  is  also  a  strategy- 

To  formalize  the  evolution  of  perception  and  play,  we  consider  the  meta-game  G 

where  a  mixed  strategy  of  a  player  is  a  pair  {x,a)  of  a  perception  x  G  A(V'"  )  and  a 

play  a  :  V^  -^  A  [S)  determining  which  strategy  a^  {v)  he  will  play  at  each  role  i  and  at 

each  profile  v  =  (tti  [w)  ,  •  •  •  ,  7r„  [w))  of  perceived  payoff  functions.  The  expected  payoff 

of  a  player  with  (x,  a)  in  a  population  with  aggregate  distribution  (y,  ji)  of  perception 

^""If  the  perception  function  were  observed,  players  would  know  not  only  how  an  agent  perceives  the 
current  world,  but  also  how  he  would  have  perceived  each  lu  £  W .  For  our  qualitative  results,  it  is  not 
important  whether  the  entire  perception  function  or  only  the  realization  of  this  function  is  observable 
(so  long  as  a  player  does  not  observe  more  about  his  perception  than  the  others  do — such  "informational 
asymmetry''  may  allow  agents  to  always  distinguish  between  all  ft'  €  U',  see  Section  5). 
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and  play  is 

U{{x,a),{y,fj,))  =  -^^^Yl  v'  is,w)  Q  {w)  a,{s„TT  {w))Y[ii{sj,n  (iv))  X  {TT,)Yly  (tTj) 

where  s  =  (si, . . . ,  5„)  and  tt  —  (tti,  . . . ,  7r„).  This  equation  states  the  following.  The 
agent  will  be  selected  to  play  a  role  i  (with  probability  1/n)  in  a  randomly  drawn 
underlying  game  {N,S,v  {■■]w))  (with  probability  Q{w)).  He  will  have  perception  ttj 
with  probabiUty  x(7r,),  while  every  other  player  j  will  have  some  perception  tTj  with 
probability  y  (tTj).  Observing  the  profile  tt  (w)  =  (tti  {w)  , . . . ,  7r„  (w)),  he  will  play  some 
Sj  G  Si  with  probability  a,  (sj,  tt  (w)),  while  every  other  player  j  will  play  some  strategy 
Sj  with  probability  ^  (sj,  tt  (w))-  The  agent's  true  contingent  payoff  for  such  a  play  will 
be  ii''  (s,  w). 

Definition  5  A  pair  (x,  a)  is  said  to  be  Neutrally  Stable  for  G  iff,  for  each  [y,  ji),  there 
exists  some  e  >  0  such  that 

U  {{x,  a),  e{y,  fi)  +  {1  -  e)  {x,a))  >  U  ((y,  //),  e(y,  i^)  +  {1  -  e)  [x,  a)) 

for  each  e  E  (0, e). 

As  noted  above,  neutral  stability  corresponds  to  the  Lyapunov  stability  in  replicator 
dynamics.  It  is  also  clear  that  when  (.r,  cr)  is  neutrally  stable,  {{x,a)  ,{x,a))  is  a  Nash 
equilibrium  in  the  meta-game  G,  i.e.,  U  ((.r,  a),  {x,  a))  >  U  {{y,  /j.),  (.t,  a))  for  each  (y,  ;x) 
(see  for  example  Weibull,  1997). 

As  in  the  single-agent  case,  we  can  define  a  meta-game  to  describe  the  evolution  of 
perceptions  for  a  fixed  solution  concept  and  the  evolution  of  the  solution  concept  (or 
play)  given  a  fixed  set  of  perceptions.  The  discussion  of  Game  2  in  Section  2  illustrates 
that  when  evolution  is  only  limited  to  perceptions,  misperceptions  can  arise  as  (neu- 
trally) stable  outcomes  of  evolutionary  processes.  Here  we  discuss  the  evolution  of  both 
perceptions  and  play.  To  this  end,  we  first  analyze  evolution  of  play  under  a  common 
(and  fixed)  perception,  and  then  show  that  any  combination  of  uncommon  perceptions 
actually  embed  a  common  perception.  Using  this  insight,  we  then  provide  a  general 
characterization  of  the  simultaneous  evolution  of  perceptions  and  play. 
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4.2      Evolution  of  play  under  a  common  perception 

Consider  the  case  where  all  agents  are  restricted  to  have  a  common  perception  of  the 
payoff  function,  while  how  they  play  the  game  (given  the  perception)  evolves  according 
to  the  process  we  defined  above.  In  this  case,  the  common  prior  (perception)  assumption 
holds  by  construction. 

Consider  a  meta-game  G^-  where  all  agents  share  a  fixed  pure  perception,  tt,  and 
where  the  strategy  of  an  agent  is  some  a'^  :  V  —^  A{S)  determining  which  strategy  a  J  {v) 
he  will  play  at  each  role  i  when  the  commonly  perceived  payoff  function  is  v.  Define 
7f  =  (tt,  TT, . . . ,  tt)  as  the  perception  profile  that  assigns  the  common  perception  n  to  all 
players.  Let  also  a  :  V^  -^  A  (S)  be  such  that  a'"  {.,n  [w))  =  cr  (.,  tt  [w)),  and  similarly, 
let  II  :  V^  ^  A{S)  and  ii""  :  V  -^  A{S)  be  such  that  /x^  (.,  tt  (u;))  =  /x(.,7r(w)). 
That  is,  a  and  ^  induce  the  same  behavior  as  a'^  and  ii^  when  all  players  have  common 
perception.  Then,  we  can  define  expected  (true)  fitness  in  the  meta-game  U^  as: 

t/M'7^/i^)  =  f/((7f,a),(7f,/.))  =  ^^5;]5^^;'(5,«;)Q(7i;)a,(5,,7^(u;))J];^^-(s,,7^(u-)), 

ieN  wew  ses  a^x 

where  tt  =  (tt,  tt,  .  .  .  ,  tt).  Definition  5  immediately  implies  that  a  play  a  is  Neutrally  Sta- 
ble for  Gt,  iff,  for  each  /^i,  there  exists  some  e  >  0  such  that  U  ((tt,  a) ,  e  (tt,  /i)  +  (1  —  e)  (tt,  a))  > 
U  ((tt ,  /x) ,  e  (tt ,  n)  +  [1  —  e)  (tt,  a)),  or  alternatively  such  that  V^  (a,  e/i .  +  (1  —  e)  cr)  > 
V  (^,  e^  +  (1  -  e)  a)  for  each  e  G  (0,  e). 

The  information  incorporated  in  a  common  perception  tt  is  represented  by  the  infor- 
mation partition 

r  ^  {tt"^  {tt  {w))  \w  e  W}. 

The  cell  that  contains  a  given  w  is  denoted  by  /"  {w)  —  tt"^  {n{w)).  No  lu'  G  I'^  {w) 
can  be  distinguished  from  w  using  the  knowledge  of  tt.  Knowing  the  function  tt  and  the 
realization  n{w),  a  Bayesian  can  infer  that  the  true  world  is  in  /'^  (w)  =  vr"^  {tt{w)), 
but  when  /'^  {w)  is  not  a  singleton,  he  cannot  distinguish  between  different  worlds  con- 
tained in  P  (lu).  Therefore,  the  expected  true  fitness  given  the  realization  n  {iv)  and  the 
function  tt  is 

EQ[v\r{w)]^- ^-—    Yl    Hv^nQi^o). 
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In  the  case  where  tt  is  an  invertible  function,  tt~^  {n  {w))  =  iv,  and  Eq[v\P  {w)]  = 
v{-;w).  If  Bayesian  players  knew  only  the  perception  function  it  and  the  perceived 
payoffs  7r(tf;),  using  this  coarse  information  they  could  compute  expected  payoffs  as 

EQ[v\r{w)]. 

Definition  6  Given  any  (tTjct),  play  a  is  said  to  be  effectively  rational  with  respect  to 

I""  ^ff 

a  G  arg  max  V"  Eq  [u'  (5)  \r  {w)]  a'  (s^)  /i~'  (s_j) 
f'     — i, 

for  some  belief  iJ,_j^  about  the  other  players'  play  s_j  for  each  i  E  N  and  w  G  W .  A  play 
a  is  said  to  be  effectively  rational  with  limited  information  iff  it  is  effectively  rational 
with  respect  to  some  I^ .  A  play  a  is  said  to  be  effectively  rational  with  full  information 
iff  it  is  effectively  rational  with  respect  to  /"'  where  tt*  [w)  —  w  at  each  w. 

Because  a  maximizes  the  agent's  utility  given  some  belief,  ^-_j,  regarding  other  play-, 
ers'  behavior,  it  also  rules  out  the  play  of  dominated  strategies  given  the  information  set 
/'".  In  this  sense.  Definition  6  generahzes  Definition  2  from  previous  section  to  the  case 
with  strategic  interactions.  When  the  payoff  to  each  agent  is  independent  of  the  oth- 
ers' actions,  the  definition  of  effectively  rational  behavior  with  full  information  coincides 
with  that  in  Definition  2. 

Next,  given  the  information  set  P  (w),  one  can  construct  a  new  game 
{N,S,Eq  [v\I'"  (w)])  with  a  set  of  players  A'',  strategy  spaces  S,  and  payoff  functions 
Eq  [v\P  (w)],  from  the  underlying  game  (A'',  S,  v).  We  refer  to  this  as  a  coarsened  game, 
since  the  payoff  functions  are  the  averages  of  the  true  payoff  functions  over  an  arbitrary 
information  set  /",  which  may  not  distinguish  all  underlying  worlds  from  each  other. 
We  now  establish  that  there  is  a  close  link  between  the  Nash  ecjuilibria  of  this  derived 
game  and  the  neutrally  stable  outcomes  of  G„. 

Proposition  3  Given  any  n  :  W  -^  V  and  any  a  :  V  ^>-  A(S'),  if  a  is  neutrally 
stable  for  G„,  then  a  (tt  («;))  is  a  Nash  equilibrium  of  game  {N,  S,  Eq  [v\I'"  [w)])  for  each 
w  E  W .   Therefore,  b  otx  is  effectively  rational  with  limited  information. 

Proof.  We  will  prove  the  contrapositive  of  the  first  part.  Let  a  :  V  —^  A  (S)  be  such 
that  a  (tt  (u;))  is  not  a  Nash  equilibrium  of  {N,  S,  Eq  [v\I"  {w)])  for  some  w  eW.  Then, 
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there  exists  some  k  e  N  and  ak  (tt  {w))  G  A  (5'fc)  such  that 
INC  =  J2^Q  [^'  (^'  ^"^  1^"  (^)]  ^'^  (^^^  "^  ("'))  n  '^^  ^^J'  "^  ^"^^^ 

-  J]  E-Q  [v''  (S,  W)  |/^  (W)]  afc  (Sfc,  TT  (W))  J]^  <Jj  (Sj,  TT  (w))  >  0. 

Now,  consider  a  :  F  ^  A  (5)  with 

_    ,  \  _  /  ^fc  (^  ('"^))    if  ^  =  ^  and  v  —  n  {w) , 
y  ^^^{v)  otherwise. 

Using  the  definitions,  we  compute  that 

[/-(a,<j)  =  f/'^(a,a)  +  /iVC--     y     Q(u/)  >?7"(a,a), 

n     ^—^ 

showing  that  {a,  a)  is  not  a  Nash  equihbrium  of  the  meta  game  G-^.  But  {a,  a)  is  a  Nash 
equihbrium  of  G^  whenever  a  is  neutrally  stable  for  G^.  Therefore,  a  is  not  neutrally 
stable  for  G^.  That  aoir  is  effectively  rational  with  limited  information  then  immediate^ 
follows  from  the  fact  that  no  dominated  strategy  is  played  in  a  Nash  equilibrium.    ■ 

This  proposition  implies  that  only  Nash  equilibrium  strategies  in  the  "coarsened" 
game  (A'',  S,  Eq  [vlP  («')])  are  candidate  neutrally  stable  outcomes  of  the  common  per- 
ception meta-game,  G.„.  The  intuition  for  why  Nash  equilibria  of  the  underlying  game 
can  be  neutrally  stable  is  straightforward:  suppose  all  players  share  the  same  accurate 
perception.  Then,  standard  arguments  from  evolutionary  game  theory  imply  that  Nash 
equilibria  of  the  underlying  game  will  be  stable  outcomes  of  the  meta-game  G^,  and 
therefore,  evolution  leads  to  behavior  that  is  effectively  rational  with  limited  informa- 
tion. The  same  reasoning  immediately  generalizes  to  the  case  where  tt  is  not  accurate: 
even  if  players  have  misperceptions,  what  matters  is  their  fitness,  determined  b}-  payoff's 
in  the  underlying  game,  v,  and  the  information  incorporated  in  these  perceptions.  Fi- 
nally, to  see  why  Nash  equilibria  of  the  "coarsened"  game  {N,S,Eq  [v\I'^  (w)])  can  be 
neutrally  stable,  recall  the  example  in  the  introduction  where  suppression  of  information 
was  mutually  beneficial.  When  all  players  have  the  same  perception  not  enabling  them 
to  distinguish  between  different  worlds,  Nash  equilibria  of  this  coarsened  game  can  also 
be  neutrally  stable.  We  will  provide  a  more  detailed  analysis  of  tliis  in  Example  2  below. 

Motivated  by  the  idea  that  agents  can  adjust  their  behavior  much  faster  than  their 
preferences,  Dekel,  Ely  and  Yilankaya  (1998)  assume  that  agents  will  always  play  a 
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Nash  equilibrium  of  the  game  where  the  payoffs  are  determined  by  their  preferences, 
even  though  these  preferences  may  not  reflect  their  true  fitness.  Our  proposition,  in 
contrast,  can  be  interpreted  as  showing  that  when  players  adjust  their  behavior  faster 
than  their  preferences  (in  the  sense  that  their  behavior  can  change  while  their  preferences 
are  fixed),  the  play  will  approach  a  Nash  equilibrium  of  the  underlying  game,  where  the 
payoffs  are  players'  true  fitness,  not  their  perceived  payoffs. 

We  next  extend  Proposition  3  to  the  case  of  a  mixed  common  perception  (rather  than 
a  common  pure  perception).  A  nuxed  common  perception  is  a  probability  distribution 
X  G  A(n)  as  in  G.  Unlike  in  G,  however,  we  require  the  agents  to  share  the  same 
perception  tt  at  each  realization  tt.  Thus,  by  construction,  players  always  share  a  common 
perception.  Now  construct  the  meta-game  C^;  with  payoff  function 

f/'(CT,M)    =    yU'(a.,j.)x(T,) 


Tren 


■K&n  i^N  w^W  s&S  ]+i 

where  7f  =  (tt,  . . .  ,7r).  Naturally,  a  play  o  is  Neutrally  Stable  for  G^  iff  for  each  /7., 
there  exists  some  e  >  0  such  that  U^  (a,  e/i.  +  (1  —  e)a)  >  U^  (^t,  e//  +  (1  —  e)  a)  for  each 
ee(0,e). 

Take  any  commonly  perceived  payoff  function  v  —  i^  {lu)  for  some  w  G  W  and  vr  with 
x{tt)  >  0;  the  profile  of  perceived  payoffs  is  {v,v, . . .  ,v).  Denote  the  set  of  perception 
functions  that  can  lead  to  perceived  payoff  v  by  11^,  =  {ttItt  {w)  =  v  for  some  w  G  W}. 
Observing  v  and  knowing  the  distribution  x  on  perceptions,  a  Bayesian  can  update  his 
beliefs  about  the  perception  functions  and  compute  the  expected  payoff  functions  given 
this  coarse  information.  Given  any  tt  G  Fl,,,  the  posterior  probability  that  the  common 
perception  function  is  tt  is 

^    ,    ,        N  2:  (tt)  X  (tt) 

When  TT  ^  rij,,  we  have  Pr(7r|;r,^;)  =  0.  Now,  given  any  common  perception  tt,  and 
the  commonly  perceived  payoff  v  —  ■k  (lu),  the  expected  true  payoff  (as  perceived  by  a 
Bayesian)  will  be  Eq  \v\n~^  (v)]  =  Eq  [v\P  (w)]  .  Then,  the  expected  payoff  function 
given  V  =  -k  {iv)  and  x  will  be 

E[v\x,v]^    Y^   Fr{Tr\x,v)EQ[v\TT-^{v)]  =    J^  ^r  {tt\x,w  {w))  Eq  [v\r  {tu)]  . 
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Effective  rationality  mth  respect  to  x  is  defined  as  in  Definition  6,  vising  E  [v\x,v\  as  the 
payoff  function. 

For  any  perceived  payoffs  are  f  =  tt  (w),  we  can  now  construct  another  derived  game 
{N,S,E\v\x,v\).  Once  again,  we  are  interested  in  this  game  because  there  is  a  close 
fink  between  the  Nash  equihbria  of  this  derived  game  and  the  neutrally  stable  outcomes 
resulting  from  evolution  over  play. 

Proposition  4  Given  any  mixed  common  perception  x,  and  any  n  with  x(7r)  >  0,  if 
(J  :  y  — >  A  {S)  is  neutrally  stable  for  G^,  then  a  (tt  {w))  is  a  Nash  equilibrium  of  game 
{N,  S,  E  [v\x,  n  (w)])  for  each  w  E  W .  Therefore,  a  on  is  effectively  rational  with  limited 
information. 

This  proposition  extends  Proposition  3  to  the  case  of  mixed  common  perceptions.  It 
is  useful  because  when  we  consider  evolution  over  perceptions  and  play,  we  will  encounter 
mixed  perceptions.  We  omit  the  proof  of  this  proposition  which  is  similar  to  that  of 
Proposition  3. 

4.3      Co-evolution  of  perceptions  and  play 

Now  we  consider  the  meta  game  G,  where  the  perceptions  and  play  evolve  together, 
and  show  that  at  any  neutrally  stable  outcome  the  agents  play  a  Nash  equilibrium 
of  a  coarsened  version  of  the  underlying  game.  Towards  this  goal,  we  first  derive  the 
common  mixed  perception  x  embedded  in  (possibly  non-common)  mixed  perceptions  x. 
Loosely  speaking,  the  common  perception  x  embedded  in  x  will  contain  exactly  the  same 
information  regarding  the  underlying  state  as  the  initial  uncommon  (mixed)  perception 
profile  X.  We  will  then  show  that,  for  any  neutrally  stable  {x,  a)  in  the  meta-game  G,  the 
play  a  is  neutrally  stable  for  the  meta-game  Gx,  where  the  agents'  perceptions  are  fixed 
at  the  mixed  common  perception  x  embedded  in  x.  Proposition  4  proves  that  for  each 
neutrally  stable  (x,  a)  for  G,  and  for  each  realization  n  (w)  of  agents'  perception  profile, 
cr  (tt  («;))  is  a  Nash  equilibrium  of  the  coarsened  game  {N,S,  E[v\x,n  (w)]),  where  agents 
have  a  common  perception  (prior),  and  estimate  their  payoffs  via  Bayes'  rule  using  all 
available  information.  This  will  establish  our  claim  in  the  introduction  that  despite 
different  perceptions,  agents  will  play  as  if  they  share  a  common  perception,  and  will 
choose  some  effectively  rational  behavior  with  limited  information. 
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We  take  aset  V  D  V  with  cardinality  |y  |  —  \V"'\-  The  set  V  will  be  the  image  of  the 
common  perceptions  embedded  in  any  perception  profile  tt.^^  We  also  fix  a  one-to-one 
and  onto  function 

The  function  (/>  will  provide  the  isomorphism  between  the  perception  profiles  and  the 
common  perceptions  embedded  in  them. 

Common  perceptions  embedded  in  perception  profiles     Given  any  perception 

profile  7r  =  (tti,  . . . ,  7r„)  G  H^ ,  we  consider  a  (generalized)  perception  tt  :  W  ^  V  with 

TT  —  (f)  O  TT. 

The  set  of  all  such  (generalized)  perceptions  is  denoted  by  n  =  V^ .  For  each  such 
perception  tt  G  Fl,  we  also  have  the  perception  profile  tt  —  0"^  o  jr.  We  will  say  that  the 
common  perception  tt  is  embedded  in  the  perception  profile  n:  it  contains  exactly  the 
same  information  as  the  unconrmon  perception  profile  tt.  That  is,  at  each  w  G  W,  we 
have 

/*  (w)  =  n-^  {^  {w))  =  n"'  {(/)-'  (0(7r  (w))))  =  tt-\tt  {w))  =  r  (w)  .  (1) 

Hence,  the  information  partitions  generated  by  n  and  n  are  the  same.  Here,  /"^  {lu)  = 
Pljg^  Z""'  {w)  —  {w'Itt,  {w')  =  TTi  {w)yi  G  A''}  is  the  set  of  all  worlds,  w',  that  cannot  be 
distinguished  from  world  w  by  looking  at  the  realization  of  tt. 

The  mixed  common  perceptions  embedded  in  mixed  (uncommon)  perceptions  are 
defined  similarly.  For  each  mixed  perception  x,  we  define  a  mixed  coinmon  perception 
X  on  n  by  setting 

S:{n)  =  Pv{r'{^)\x)  =  l[x{7T,)  (2) 

at  each  tt  G  H,  where  (l)~^  o  tt  =  (tti,  .  . .  ,7r„)  =  tt.  The  mixed  common  perception  x  is 

embedded  in  the  mixed  (uncommon)  perception  x  in  the  sense  that  x  assigns  the  same 

probability  to  the  conunon  perception  tt  =  0  o  tt  embedded  in  each  perception  profile  tt 

as  the  x  assigns  to  the  probability  profile  tt  itself. 

""Note  that  this  theory  is  useful  when  we  approximate  the  contiunuum  with  grid  V,  in  which  case 
|y"|  and  I VI  will  have  the  same  limit. 
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As  was  the  case  with  it  and  it,  x  and  x  generate  the  same  information  partitions. 
Given  any  tt'  —  0o7r',  and  any  w  €  W ^  the  probabihty  that  we  have  a  common  perception 
-n'  given  x  and  tt  [w)  is  the  same  as  the  probabihty  that  we  have  perception  profile  tt' 
given  X  and  tt  [w),  i.e.,  Pr  {tt'\x,  tt  (w))  =  Pr  (7r'|x,  tt  (it;)).  Hence,  by  (1),  we  have 


E[v\i,n{tu)]    =     '^PT{^'\i,n{w))EQ[i!\r  {w)] 

=      J]   Pr  (tt'Ix,  tt  (w))  Eq  [v\r  (w)]  =  E  [v\x,  tt  {w)]  . 


That  is,  the  expected  payoflr  fmiction  given  the  perceived  payoffs  wiU  be  the  same  under 
X  and  X. 

The  next  lemma  clarifies  the  relationship  between  the  meta-game  G,  where  the  per- 
ceptions and  play  evolve  together,  and  the  meta-game  G^,  where  the  play  evolves  under 
the  mixed  common  perception  x. 

Lemma  1   Let  d  :  V  ^  A{S),  }:l  :  V  -^  A  (S),  a  -.V^  -^  A{S),  and  /x  :  V^  ->  A  (5) 
be  such  that  a  =  a  o  cf)  and  n  —  Jio  (f).    Then, 

U''{a,il)  =  U{{x,ao4,),{x,jlo(t)))  =  U  {{x ,  a) ,  {x , /j.)) . 

Proof.  We  have 

U^a,jl)     =     -  J]^^  ^  5]]t'M5,^t;)(5(w)x(^)a,(s„^(u;))]^/ij(Sj,7r(u')) 
sen  'eiv  w^iv  ses  jyi 


=  ^EEEE^M5,-)QH 


n 
TTcn^  2fc/v  wew  ses 


YlxiTT,) 


ieN 


=     U  {{x,  a  o  0)  ,  (x,  /i  o  0))  =  [/  {{x,  a)  ,  (x,  ^i)) 


This  lemma  states  that  the  outcome  of  the  evolution  of  play  in  any  situation  with 
widely  differing  beliefs  and  perceptions  can  be  represented  as  the  outcome  of  evolution  of 
play  in  a  situation  with  common  perceptions.  Perhaps,  the  fact  that  there  is  a  common 
perception  that  captures  the  same  information  as  a  set  of  uncommon  perceptions  is  not 
surprising.  But  this  lemma  will  enable  us  to  next  prove  the  stronger  result  that  strategies 
will  evolve,  and  the  long  run,  precisely  as  if  all  agents  are  using  this  common  embedded 
perception. 
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Evolution  of  perceptions  We  can  now  state  and  prove  the  main  result  of  the  paper, 
which  hnks  the  neutrally  stable  outcomes  of  G  to  neutrally  stable  outcomes  of  the  com- 
mon perception  game  G^,  and  to  Nash  equilibria  of  a  coarsened  version  of  the  underlying 
game  (A'^,  5,  E  [v\x,  tt  {w)]).  We  write  Pr  (7r|a:)  for  the  probability  of  a  perception  profile 
TT  under  a  mixed  perception  x,  i.e.,  Pr  (7r|x)  —  YlieN-"^  (^«)- 

Proposition. 5  For  every  neutrally  stable  (x, a)  for  G,  the  play  a  =  ao (f)~^  is  neutrally 
stable  for  d,  where  x  is  the  mixed  common  perception  defined  by  (2).  Therefore,  given 
any  tt  with  Pr(7r|x)  >  0,  and  any  w  G  W,  a  {n  {w})  is  a  Nash  equilibrium  of  game 
{N ,  S ,  E  [v\x ,  TT  (w)]) ,  where  it  =  0  o  tt.  Therefore,  a  o  n  is  effectively  rational  with 
limited  information. 

Proof.  Take  any  neutrally  stable  {x,a)  for  G.  Take  any  fi  :  V  ^  ^{S),  and  let 
a  —  a o (f)~^  and  p,  —  fiocf).  Since  (x, a)  is  neutrally  stable  for  G,  there  exists  some  e  >  0 
such  that  for  each  e  €  (0,e), 

U  ((x,  a) ,  (x,  e/i  +  (1  -  e)  a))  >  U  ((x,  p) ,  (x,  ep+{l-  e)  a)} . 

But,  by  Lemma  1,  we  have 

[/*(CT,e/i+ (1 -e)a)     -     [/ ((x,a  o  (/>)  ,  (x,  e/i  o  0  +  (1  -  e)  a  o  0)) 

=     U{{x,a),{x,ep  +  {l-e)a)) 

and  U"^  {p,  ejl  +  {I  —  e)  a)  —  U  ((x,  p)  ,  (x,  e//  +  (1  —  e)  cr)).  Therefore,  for  each  e  e  (0,  e), 

U^  (a,  e^  +  (1  -  e)  a)  >  U^  {~p,  e/i  +  (1  -  e)  a) , 

showing  that  a  is  neutrally  stable  for  Gf. 

The  second  statement  follows  from  Proposition  4:  since  a  ~  a  o  <p~^  is  neutrally 
stable  for  G^,  by  Proposition  4  (for  G^),  a  {tt  {w))  —  a  {tt  {w))  is  a  Nash  equihbrium  of 
(A'',  5",  E  [v\x,  TT  (w)])  at  each  w  G  W  whenever  .x  (tt)  =  Pr  (7r|x)  >  0.  Effective  rationality 
of  cr  o  7r  again  immediately  follows  from  the  Nash  equilibrium  result.    ■ 

This  proposition  states  that  evolution  of  play  will  ensure  the  emergence  of  behavior 
corresponding  to  all  agents  having  common  perceptions  and  playing  the  Nash  equilib- 
rium behavior  of  some  underlying  game.  That  is,  evolution  will  lead  agents  to  play  as 
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if  they  have  a  common  prior  and  are  effectively  rational  (in  fact,  they  play  an  equi- 
librium). This  is  despite  the  fact  that  each  agent  may  hold  different  mlsperceptions 
and  act  "irrationally"  given  their  perceptions.  Their  behavior  will  effectively  undo  these 
perception  differences,  leading  to  effectively  rational  behavior  and  to  a  situation  with 
effectively  common  perceptions. 

Notice  that  the  proposition  maps  the  neutrally  stable  outcomes  of  evolution  to  the 
Nash  ecjuilibria  of  some  coarsened  game,  yielding  effectively  rational  behavior  with  hm- 
ited  information.  This  may  not,  however,  correspond  to  the  equilibrium  of  the  full- 
information  game,  because  some  suppression  of  information  may  take  place  (hence  our 
emphasis  on  limited  information).  This  is  because  all  agents  may  develop  mispercep- 
tions  that  suppress  information  that  would  reduce  the  payoff  to  all  parties,  destroying 
some  "insurance"  opportunities — as  in  Game  3  of  Section  2.  We  will  now  present  such 
a  neutrally  stable  outcome  of  this  game  in  more  detail. 

Excimple  2:  Suppression  of  information  Consider  the  same  example  as  in  Game 
3  in  the  introduction,  with  true  payoff  function  v  for  this  game  is 

1\2 


L 

R 

3,3 

-1,1 

u',-5 

0,0 

where  x  can  take  values  0  and  5  with  equal  probabilities.  We  take  W  —  {wi  —  0,  uh  =  5} 
with  Q  {wi)  —  Q  {W2)  =  1/2,  where  w  takes  0  at  wi  and  5  at  'W2-  We  take  V  = 
{v{-;wi)  ,v  [■■,W2)  tV]  as  the  set  of  possible  payoff  functions,  where  v  —  ({)(•;  Wi)  + 
V  (•;  W2))/2,  which  corresponds,  in  this  case,  to  w  =  (0  +  5)  /2  =  2.5.  We  consider  the 
pure  perception-play  pair  (tt ,  a)  where  vf  (u;)  =  v  at  each  w  G  W  and 

-7/  \        f  (1,0)     if  f  1  =  i;2  =  v, 

'''*'"''^'={(0:i)    otherwise 

for  each  i  ^  N.  Here,  the  first  entry  is  the  probability  that  a'  assign  to  the  strategy  1 
or  L  when  he  plays  rows  or  columns,  respectively.  According  to  this  strategy,  a  player 
plays  1  (or  L)  if  each  player  perceives  the  game  as  the  average  game,  and  plays  r  (or  R) 
otherwise.  When  everyone  has  perception- play  pair  {it,  a),  each  player  gets  3. 

The  only  way  a  mutant  may  have  a  higher  payoff  is  that  when  he  is  the  row  player 
he  chooses  r  at  1^2,  and  the  column  player  continues  to  play  L,  giving  him  a  payoff  of  5. 
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Strategy  &"■  (^1,^2)  dictates  that  the  column  player  will  play  L  only  if  its  opponent,  the 
mutant  in  this  case,  perceives  the  payoff  function  at  W2  as  v.  In  that  case,  our  mutant 
must  play  r  at  {y,  v).  At  w\,  if  the  mutant's  perception  differs  from  v,  the  column  player 
will  play  R,  in  which  case  the  highest  our  mutant  gets  is  0  (when  he  plays  r).  Hence,  if 
he  distinguishes  W\  and  W2-,  the  highest  our  mutant  can  get  is  (0.5)(5)+(0.5)(0)  =  2.5, 
lower  than  his  payoff  of  3  under  (tt,  a). 

If  the  mutant  does  not  distinguish  Wi  and  W2,  he  must  perceive  the  payoff  function 
as  v  at  wi,  too.  In  that  case,  the  profile  of  perceived  payoff  functions  will  always  be 
(vjC*),  hence  the  incumbents  will  always  play  L,  and  the  mutant  will  always  play  r  and 
again  obtain  (0.5)(5)+(0.5)(0)  =  2.5,  lower  than  3.  In  either  case,  a  mutant  gets  strictly 
lower  payoff,  unless  his  perception  function  is  tx  and  he  plays  1  at  {y,  v).  Now  we  need  to 
check  whether  these  mutants,  who  do  as  well  as  incumbents  against  incumbents,  cannot 
do  any  better  when  they  meet  each  other.  But,  when  any  two  such  mutants  meet,  each 
will  get  exactly  3,  not  higher  than  the  incumbents'  payoff.  Therefore,  [it,  a)  is  neutrally 
stable. 

This  example  shows  that  some  Nash  equilibrium  of  a  coarsened  game  may  be  neu- 
trally stable  (even  when  it  does  not  correspond  to  a  Nash  equilibrium  of  the  full- 
information  game).  Not  all  Nash  equilibria  of  all  coarsened  games  can  be  neutrally 
stable  outcomes,  however.  For  example.  Proposition  2  shows  that  in  one-person  games, 
each  neutrally  stable  outcome  leads  to  effectively  rational  behavior  with  full  informa- 
tion, ruling  out  all  non-trivial  coarsenings.  We  next  investigate  this  issue  in  games  with 
strategic  interactions. 

4.4     A  further  restriction 

In  deriving  Proposition  5,  we  considered  only  mutations  of  play,  and  did  not  consider 
mutations  of  perceptions.  We  will  now  consider  mutations  of  both  perceptions  and  play, 
and  derive  a  stronger  restriction  on  the  set  of  neutrally  stable  outcomes. 
Let  us  write 


for  the  expected  value  of  having  perception  x  and  playing  a,  while  all  the  other  agents 
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do  the  same,  given  that  the  underlying  world  is  w.  We  also  define 

IT  =  -Y^  min max  ^  v'  (s,  w*)  ]^  Gj  (s,) 

i&N      "'         '      s£S  jeN 

as  the  minimum  expected  utility  an  agent  gets  if  he  recognizes  that  the  underlying  world 
is  w  and  gives  his  best  response.  Finally,  write 

Vj:  =  {v  =  TT  (w)  \x  (tt)  >  0,  w  e  W} 

for  the  set  of  payoff  functions  perceived  with  positive  probability  under  x. 

Proposition  6  Let  {x,  a)  be  neutrally  stable  for  G.  If  Vx  ^  V ,  then 

at  each  w  ^W . 

Proof.  Assume  V^  ^V  and  take  any  v  G  V\Vi.  To  derive  a  contradiction  consider 
a  neutrally  stable  (x,  a)  such  that  IP"  >  U^'  {x,a)  for  some  w*  G  W .  Consider  the 
following  mutant  (x,(t): 

1.  he  recognizes  w*:  x  (tt)  =  x  (tt)  for  each  n  and  tt  with  x  (tt)  >  0,  tt  {w*)  =  v,  and 
7f  [w]  =  n  [w)  whenever  w  ^  w*;  and 

2.  he  gives  his  best  response  at  w*:   for  each  i  &  N,  when  Vi  —  v,  a,  (I'l, . . .  ,u„)  G 

arg  max„,  Ylses  ^'  (-5'  ^^)  c^i  (■s^)  FIj^z  -^j  (sj,  vi,  • . . ,  Vn),  and  a^  (t'l, . . . ,  t'„)  =  a,  (I'l, . . . ,  t'„) 
otherwise. 

When  w  ^  w*,  the  mutant  (x,  a)  does  as  well  as  the  incumbents.  We  will  now  show 
that  he  will  do  strictly  better  than  the  incumbents  at  w*.  At  iv* ,  the  mutant  gets 


-  ^     ^     Pr  {v^^\x,  w*)  max  ^  v'  (s,  w*)  a^  [s,)  J]^  Oj  {Sj,  {v,  i'_,)) 
>     —  >^     \^     Pr  (v_j|x,  w*)  min  max  N^i)^  (s,  u'*)  TT  cTj  (sj)  . 

-  J2  n^in  iTiax  5]]  v'  (s,  w*)  n  ^J  (5j)  =  ^'^'  >  ^"'*  O^'  '^) ' 


n 

i6Af  ses  jeN 
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where  {v,  i;_,)  is  the  vector  of  perceived  payoff  functions  where  i  and  any  other  j  perceive 
the  payoff  function  as  v  and  Vj,  respectively,  and  Pr  {v_^\x,w*)  is  the  probabihty  of 
{v,  V-i)  given  that  the  incumbents  have  mixed  perceptions  x  and  the  underlying  world  is 
lu* .  (Note  that  X^„_  gyn-i  Pr  {v_i\x,  w*)  —  1;  and  at  w* ,  each  incumbent  gets  V"'  (x,  a).) 
This  establishes  that  his  expected  payoff  will  be  strictly  better  than  the  incumbents', 
contradicting  our  hypothesis  that  {x,a)  is  neutrally  stable  for  G.    ■ 

This  proposition  shows  that  in  any  Neutrally  stable  outcome  an  individual  cannot 
increase  his  payoff  by  introducing  more  information.  This  restricts  the  set  of  coarsened 
games  whose  Nash  equilibria  can  correspond  to  Neutrally  stable  behavior. 

5      Evolution  When  Perceptions  Are  Not  Observable 

We  have  so  far  limited  the  analysis  to  situations  in  which  perceptions  are  observable. 
The  previous  literature  has  also  investigated  situations  in  which  agents'  preferences  may 
be  only  imperfectly  observable  (e.g.,  Dekel,  Ely  and  Yilankaya,  1998,  Ely  and  Yilankaya, 
1999,  and  Ok  and  Vega-Rodondo,  2000).  We  now  extend  our  analysis  to  this  case. 

Suppose  that  in  addition  to  the  observable  perceived  payoff  function  7r(-,u;),  each 
agent  also  has  a  private  perception  function  n^.  As  a  result,  each  agent  will  observe  his 
own  Ti-P  {w),  and  can  condition  his  behavior  on  this  perception,  but  he  will  not  observe  the 
private  perception  of  other  players.  We  can  incorporate  this  into  our  analysis  by  simply 
redefining  the  play  function  to  have  a  larger  domain,  a  :  1/^+^  -^  A  (S).  Similarly,  we 
define  a  perception  profile  for  player  in  role  j,  tTj,  as  tZj—  (tt^,  tti,  ...,  Tr^).^^  This  implies 
that  each  agent  can  condition  his  play  on  all  observed  perceptions  and  his  own  private 
perception.  We  can  now  state 

Proposition  7  Given  any  neutrally  stable  (£,  a)  for  G  and  any  tc  with  x{Tt)  >  0, 
a  (tt  (■»;))  is  a  Nash  equilibrium  of  the  underlying  game  {N,  S,v  {■,  lu))  for  each  w  G  W . 
Therefore,  a  o  n  is  effectively  rational  with  full  infoimation. 

Proof.  An  immediate  generalization  of  Proposition  5  implies  that  all  neutrally 
stable  outcomes  correspond  to  Nash  equilibria  of  some  coarsened  game.  We  only  have 


''  It  will  turn  out  to  be  redundant  that  (7rj, ...,  tTh)  are  observable,  given  that  tt^  is  private  information. 
Nevertheless,  this  more  general  form  highlights  that  adding  a  flexible  piece  of  private  information  to 
our  setup  so  far  removes  many  of  the  neutrally  stable  outcomes  of  Proposition  5. 
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to  rule  out  Nash  equilibria  of  these  coarsened  games  that  are  not  Nash  equilibria  of  the 
underlying  game  (A^,  S,v  {■,w)).  Suppose  that  (x,  a)  is  neutrally  stable,  and  a  [n  {w"}) 
is  a  Nash  equilibrium  of  a  game  {N,  S,  Eq  \v\x,  tt  (w*)]),  but  not  a  Nash  equilibriimi  of 
{N,S,v  {■,w*))  for  some  w*  G  W.  This  imphes  that  there  exist  a  role  j,  and  a  strategy 


s'j  such  that 


y  {s'^,a_j{7r{w)),w)  >  Eq  [v'\x,7t{w)] 


Now  consider  a  mutant  with  tt^  =  yr*,  where  vr*  is  the  accurate  perception.  This  devi- 
ation is  not  observed  by  other  players,  so  their  strategies  are  still  given  by  a^j  (tt  {w)). 
This  then  enables  the  mutant  to  choose  s'j  when  he  plays  the  role  j  at  w* ,  increasing 
his  fitness.  Hence,  (x,  a)  is  not  neutrally  stable.  Since  a  {n  [w))  is  a  Nash  equilibrium  of 
the  full  information  game,  it  rules  out  the  play  of  all  dominated  strategies,  hence  a  o  tt 
is  effectively  rational  with  full  information.    ■ 

Therefore,  once  we  allow  perceptions  not  to  be  observed,  we  obtain  a  stronger  propo- 
sition than  Proposition  6:  now  only  Nash  equilibria  of  the  underlying  game  are  candidate 
neutrally  stable  outcomes.  This  is  intuitive,  since  starting  from  any  outcome  that  is  a 
Nash  equilibrium  of  some  coarsened  game,  but  not  a  Nash  equilibrium  of  the  underlying 
game,  a  player  can  take  a  deviation  to  expand  his  information  set  and  increase  his  payoff. 

6     Concluding  Remarks 

There  are  many  examples  of  potentially  irrational  behavior  that  may  have  evolutionary 
benefits  by  acting  as  a  commitment  device.  For  example,  the  tendency  to  seek  revenge 
may  be  a  useful  commitment  to  punish  those  who  break  their  promises.  In  this  paper 
we  argue  that  a  key  link  in  this  reasoning,  that  of  subjective  rationality,  does  not  have 
strong  evolutionary  foundations,  undermining  much  of  the  appeal  of  this  argimient. 
Subjective  rationality  requires  that  agents  always  choose  actions  that  maximize  their 
perceived  payoffs.  Although  this  appears  almost  tautological,  we  show  that  when  there 
are  misperceptions,  subjectively  irrational  agents  can  have  greater  evolutionary  success 
than  subjectively  rational  agents. 

Once  we  allow  mutations  to  lead  to  subjective  irrationality  as  well  as  mispercep- 
tions, we  find  that  selection  will  lead  to  effectively  rational  behavior,  albeit  with  limited 
information:    perceptions  or  preferences  act  as  commitment  devices  only  when  they 
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are  informative  about  future  behavior,  and  relaxing  subjective  rationality  breaks  this 
link.  Thus  somewhat  paradoxically,  subjective  irrationality  creates  stronger  evolution- 
ary forces  towards  "rational  behavior" . 

Interestingly,  there  is  no  immediate  link  between  rational  behavior  and  accurate 
perceptions.  Evolution  will  select  agents  who  will  act  "rationally" ,  but  these  agents  will 
have  very  different  perceptions,  and  sometimes  will  make  systematic  mistakes  (yet  their 
mistakes  ■will  cancel  each  other).  This  reasoning  suggests  that  systematic  misperceptions 
in  experimental  settings  or  in  situations  with  little  relevance  to  long-run  fitness  do  not 
necessarily  translate  into  widespread  "irrational"  behavior. 

We  also  show  that  even  though,  in  an  "evolutionary  equilibrimn,"  agents  will  have 
very  different  perceptions,  they  will  possess  effectively  common  perceptions  of  the  game 
and  the  payoffs,  in  the  sense  that  they  will  play  as  if  they  have  a  common  prior. 
This  result  is  of  interest  since  there  is  disagreement  over  whether  the  common  prior 
assumption  in  game  theory  has  good  theoretical  foundations.  Our  analysis  suggests 
that  this  assumption  may  have  good  evolutionary  foundations. 

It  is  important  to  emphasize  that  our  analysis  does  not  imply  that  agents  will  be- 
have "rationally"  in  all  situations.  First,  according  to  our  results,  when  perceptions  are 
observable,  behavior  will  be  effectively  rational  only  with  limited  information,  and  may 
deviate  from  the  Nash  equilibrium  of  the  full-information  game.  This  is  because  sup- 
pression of  information  may  be  mutually  beneficial  for  all  agents.  Second,  our  analysis 
has  been  limited  to  the  case  where  there  are  no  cognitive  restrictions  on  behavior,  and 
therefore  should  be  interpreted  as  implying  that  there  will  not  be  systematic  biases  given 
the  cognitive  resources  available  to  the  agents.  This  does  not  rule  out  boimdedly  ratio- 
nal behavior  because  of  cognitive  limitations.  Finally,  as  with  all  evolutionary  analyses, 
our  results  apply  in  the  "long  run",  and  adjustment,  especially  after  important  changes 
in  environments,  may  take  a  long  time,  during  which  behavior  that  is  not  neutrally  or 
evolutionarily  stable  can  be  observed. 
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